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ABSTRACT 

The Alpert-Haber Achievement Anxiety Test was 
developed to measure the extent to which individuals experience test 
anxiety. In at least two published studies, the authors claim to have 
used the test, when in fact the response format was changed from that 
used in the original instrument and the **buffer" items were emitted. 
To investigate the possibility that these seemingly minor alterations 
might change the statistical characteristics of the Achievement 
Anxiety Test, 104 Ss were randomly assigned to each of the three 
versions of the Achievement Anxiety Test--the original and the two 
"revisions." An analysis of covariance revealed significant 
differences between the three forms on all three possible scales: 
Facilitative, Debilitative, and Facilitative-minus-Debilitati ve. The 
three versions of the Achievement Anxiety Test also differed in terms 
of intro-form correlation between the Facilitative and Debilitative 
scales and internal consistency reliability estimates. Also of 
interest is the finding that, within each of the three forms, the 
negative correlation between Facilitative and Debilitative scales 
caused the Facilitative-minus-Debilicative scale to yield a higher 
reliability coefficient than either the Facilitative scale or the 
Dei)ilitative scale. (Author) 
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THE EFFECT OF VARYING THE RESPONSE FUIu'L\T OK THE 
STATISTICAL CHARACTERISTICS OF THE ALPERT- 
HABER ACHIEVEMENT ANXIETY TEST 
Abs tract 

The Alpert-Haber Achievement Anxiety Test was developed to measure the ex- 
tent to which indiviuuals experience test anxiety. In at least two published 
studies, the authors claim to have used the test when in fact the response for- 
mat was changed fron that used in the original instrument and the "buffer'* items 
were omitted. To investigate the possibility that these seemingly minor alter- 
ations might change the statistical characteristics of the Achievement Anxiety 
Test, IDA _Ss were randomly assigned to each of the three versions of the Achieve- 
ment Anxiety Test — the original and the two "revisions/' An analysis of co- 
variance revealed significant differences between the three forms on all three 
possible scales: Facilitative, Debilitative , and Facilitative-minus-Debilita- 
tive. The three versions of the Achievement i\nxiety Test also differed in terms 
of intra-form correlation between the Facilitative and Debilitative scales and 
internal consistency reliability estimates. Also of interest is the finding 
that, within each of the 'iLree forms, the negative correlation between Facili- 
tative and ^'ebili tative scales caused the Facilitative-minus-Debilitative scale 
to yield a higher reliability coefficient than either the Facilitative scale or 
the Debilitative scale. 
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THE EFFECT OF VARYING THE RESPONSE FORMAT ON THE 
STATISTICAL CHARACTERISTICS OF THE ALPERT- 
HABER ACH I FAKEMENT ANXIETY TEST 



\jj Edward J. Jacko and Schuyler W» Huck 

The University of Tennessee 
In conducting original research investigations, an investigator must ask him- 
self whether he has correctly and thoroughly described both the independent and 
dependent variables* This is imperative if others are to attempt replication of 
the study or utilization of the measuring devices. Also, when a researcher states 
/3 that a particular study was repeated or a specific measuring instrument was used, 
no alterations should be made without a thorough detailing of these changes. Al- 
though most writers wou}d agree that this goes without saying, violations are quite 
evident throughout the published literature. 

The Alpert-Haber Achievement Anxiety Test (Alpert & Iiaber, 1960) is a case in 
point. The Achievement Anxiety Test was developed to indicate the extent to which 
an individual experiences anxiety in academic achievement test situations and 
whether the anxiety has a facilitative or debilitative effect on test performance. 
The instrument contained 28 items: nine constituted the *^f acilitative" scale, 10 
th? debilitative" scale, and the remaining nine items were "buffer items." Each 
itc:a v;-:..^ made up of a statement and five possible responses listed vertically, as 
iii rr.alt iplc-choice format. For example, the 16th item or. the original Achievement 
Anxiety Test appeared as follows: 

I look forward to exams. 

(a) Never 

(b) Hardly ever 

(c) Sometimes 

(d) Usually 

(e) Always 

O 'hroughout the original Achievcm.ent Anxiet> Test, the responses were tailored to 
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each statenionL and thus the phraseology of the rerponses varied acrofss the 28-item 
inventory. The person taking the scale was asked to respond to each statement by 
selecting one of the five responses which most described his feelings. 

A study by Smouse and Munz (1969) indic:Ued that individuals stereotyped on 
the basis of the Achievement Anxiety Test perform differently on academic achieve- 
ment tests depending on item difficulty sequencing. However, the basic Achievement 
Anxiety Test, as devised by Alpert and Haber, was not used. Instead, only 19 of 
the items constituted the modified version, the buffer items being eliminated. More 
importantly, the format for responding involved a five-point horizontal continuum 
rather than the "multiple-choice" format. Although the end points of the continuum 
were verbally defined with the same words or phrases that were used for the first 
and last responses from the original Achievement Anxiety Test, the intermediate three 
points on the continuum were left undefined. Thus, item 16 from the original Achieve- 
ment Anxiety Test appeared in the Smouse and Munz version as follows: 

I look forward to exams . 

12 3 4 5 

I I I I I 

Never Always 

These changes to the Achievement Anxiety Test were not mentioned as such in the pub- 

lisl>ed articles. 

Walsh used still another variation of the Achievement Anxiety Test in three re- 
cent studies. His investigations involved the relationship between test anxiety and 
(a) extroversion, neuroticism, and optimism (Walsh, 1968a), (b) classroom test per- 
formance (Walsh, Engbretson, & O'Brien, 1968b), and (c) psychological needs, as 
measured by the Edwards Personal Preference Schedule (Walsh, 1969). His modified 
Achievement Anxiety Test utilized a Likert-typo response format for each statement 
with the phraseology of the five response options being constant across all items. 
For example, item 16 from the original Achievement Anxiety Test appeared in the 

ERLC 



4 



Walsh version as follows: 

R S F G A 
I look forward to exams. 

The directions explained that K = rarely, S = sometimes, Y = frequently, G = gen- 
erally, and A = almost always. On this version of the Achievement Anxiety Test, 
the buffer items again were omitted. As was the case with Smouse and Munz, Walsh 
did not indicate these changes to the original Achievement Anxiety Test. 

Several additional investigators (Bauer, 1971; Hermans, Laak, & Maes, 1972; 
Laxter, Quarter, Koom^an, & Walker, 1969; Laxter L Walker, 1970; Wittmaier, 1972) 
have conducted studies in which it is stated that the Alper *:-Haber Achievem.ent 
Anxiety Test was used. In light: of the above, merely menti-^ning the use of the 
Achievement Anxiety Test leaves us in doubt as to whether the basic instrument is 
actually being utilized. Of course, if the different versions of the Achievement 
Anxiety Test produce similar results, then we need not be concerned. 

With the thought that variations in tae response format might cause examinees 
to respond differentially to the Achievement Anxiety Test, the present study was 
undertaken to compare the three forms of the Achievement Anxiety Test described 
above. More specifically, comparisons were made in terms of mean score and reli- 
abilities for the three possible scores that could be computed from the Achievement 
Anxiety Test: Facilitative , Debilitative, and Facilitativc-minus-Debilitative . In 
addition, the three forms were compared in term.s of intra-form correlation between 
the facilitative and debilitative scores. Any significant differences would ob- 
viously negate, generalizinr^ that all forms of the Achievement Anxiety Test would 
equally measure test anxiety and would provide substantiation for the requirement 
of correct and thorough description of all variables in published studies. 

M:-THOl 

Subjects 

Tlie subjects (_Ss) for this experimental study consisted of all undergraduate 
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students enrolled in the nino st*ctioiu> of an adolescent p:;ycholoi^y cout\se offered 
at Tiie University of Tcnne.sseu durinr. the fall acadonic term, 1972. A total of 
312 Ss were enrolled in these classes, with an average class size of approximately 
35 students. 
Procedure 

Prior to the day of the experiment, 312 packets of material were assembled by 
t'.ie i nve^ t i^a t cn-s . The first itei i in each packet was a single sheet of paper request- 
ing the student to rate himself in terms of how his test performance was affected 
by test anxiety. The instructions asked the student to indicate whether test anxiety 
prevented him from doing well on tests or helped him to achieve high exam scores, 
and a -5 to 4-5 scale was provided for the self rating. This scale was carefully de- 
fined such that -5 would indicate the maximum amount of negative test anxiety, a 0 
would indicate no test anxiety at all, and a 4-5 would indicate the maximum amount 
of positive test anxiety. The "self rating" sheet was identical for all 312 packets. 

The second item in each packet was an envelope containing one of the three 
forms of the Achievement Anxiety Test. Except for necessary modifications in the 
instructions to achieve uniformity, these three forms of the Achievement Anxiety Test 
were reproduced exactly as they had been used in previous research studies. This 
meant that tlie three forms were identical in terms of the wording of the statements 
to which the S^s responded, and dissimilar in terms of the response format and the 
inclusion of the buffer itcrr.s on the original 1960 form. 

The actual experiment was conducted on the first day of the academic term. In 
each participating class, the regular instructor associated with that particular 
section of the adolescent psychology course distributed the packets to the students. 
Since the i>ackets h^ad previously been arranged in random order, one-third of each 
cla.ss was random.ly assi^»ned to a different form of the Achievement Anxiety Test. 
After eac]i student had received a packet, the instructor emphasized the importance 
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of compJ.cLiiui the "self rating" before openinf'^ the envelope. In addition, the r.tii- 
dents were encouraged to be honest in completing both items in their packet since 
no names were requested on the single sheet of paper (i.e., the "self rating") or 
the envelope or the i:i.it»'rlal inside the envelope. 
Statistica l Tj;;jl a_tr,(^nr ot I\i_ta 

For each of the S^s , three criterion nrieasures were derived from the S_* s re- 
sponses to the Achieverunt Anxiety Test. These were the Facili tat ive , the Debili- 
tative, and the Facilitative-minus-Debilitative scores. A separate one-way analysis 
of covariance was used to compare the three forms of the Achievement Anxiety Test 
with respect to each criterion measure, with scores from the "self rating" serving 
as the covariate in each analysis. Since three treatment groups were involved in 
these covariance analyses, Tukey * s post hoc multiple comparison test was used to 
compare the adjusted criterion means following an over-all significant F-ratio 
(Winer, 1971, p. 772). 

Prior to using the analysis of covariance, the assumption of homogeneous 
within-group regression slopes was tested. In accordance with Kirk's (1968) re- 
commendation, this assumption was tested at the .25 level of significance to pro- 
tect against the possibility of a Type II Error. The assumptions of normality and 
homogeneity of variance were not tested since the analysis of covariance, like the 
analysis of variance, is robust to violations of these assumptions under the con- 
dition of equal numbers of S^s in the treatment groups. 

The reliabilities for the Facilitative and Debilitative scales of each form 
of the Achievement Anxiety Test were estimated through an extension of the Hoyt 
analysis of variance reliability procedure (Hoyt & Stunkard, 1952). The reliability 
of the Facilitative-minus-Debilitative scores for each form of the instrument was 
also estimated through the use of Horst's (1966) formula for estimating the relia- 
bility of difference scores. 
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Finally, the correlation botveen the Faci]itative and Debilitative scores 
within each fora of the Achievement Anxiety Test was obtained. These three corre- 
lations were then compared with an appropriate chi square test for determining 
whether significant differences are present among k correlation coefficients, 
where k > 2. 

All major analyses of data were conducted at The University of Tennessee Coin- 
puter Center through the use of MANOVA and SAS library programs. For all inferen- 
tial statistical tests (except those dealing with the assumption of homogeneous re- 
gression slopes in the covariance analyses) » the present investigators made an a 
priori decision to use, and to report, the .05 level of significance. 

RESULTS 

Means and standard deviations on the "self rating" (the covariate) and the 
Facilitat ive , Debilitative, and Faciliative-minus-Debilitative scales of the Achieve- 
ipent Anxiety Test for each of the three treatment groups are presented in the first 
three columns of Table 1. Also presented in these columns are the adjusted means 
for each treatment groups on each scale. 



Insert Table 1 about here 



Prior to using the analysis of covariance to compare the three treatment groups 
on each scale of the Achievement Anxiety Test, a test was made to determine whether 
the assumption of homogeneous within-group regression slopes was tenable. The re- 
sults of these tests are presented in the fourth column of Table 1, and in each case 
the assumption was not rejected (p >.25). This outcome permitted the present inves- 
tigators to use the conventional covariance analysis in comparing the adjusted means. 

As indicated in the last column of Table 1, significant differences (p <.05) 
were found among the throe treatment groups on all three scales of the Achievement 
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Anxiety Test. With respect to the lacilltative scale, Tukey's multiple comparison 
test revealed significant differences (p * .05) between all pairwise comparisons. 
On the Debililative and Facilitative-minus-Debilitntive scales, Tukey's test indi- 
cated significant differences between Forms ifl and //2 and between Forms ill and //3 
but not between Forins iH and //3« 

The reliability estimates for the Facilitative , Debilitative , and Facilita t ive- 
minus-Debilitative scales on each of the three forms of the Achievement Anxiety Test 
are presented in Table 2. The reliability coefficients ranged from .65 to .76 on 
the Facilitative scale, from .76 to .86 on the Debilitative scale, and from .83 to 
.88 on the Facil i tative-minus-Debilitative scale. With respect to each of the three 
fonr.s of the Achievement Anxiety Test, the reliability of the Facilitat ive-minus- 
Debilitative scale was higher than the reliability for either the Facilitative scale 
or the Debilitative scale. This increase in the reliability of the difference scores 
is attributable to a negative correlation between the Facilitative scores and the 
Debilitative scores: -.66 for Form t'/I, -.42 for Form #2, and -.60 for Form #3. 

When compared statistically, significant differences were shown to exist among these 
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three correlation coefficients ( X = 6.001, df = 2, p <.05). 



Insert Table 2 about here 



DISCUSSION 

The results of this research investigation demonstrate that sometimes oven 
seemingly minor variations of the original Alpert-Haber Achievement Anxiety Test 
can yield scores having different means, reliability estimates, anJ intra-form cor- 
relations between the Facilitative and Debilitative scores. In addition to the find- 
ing of differences among forms, this study reveals that the negative correlation 
between the Facilitative and Debilitative scales within any one form of the Achieve- 
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ment Anxiety Test cause s the reliability ot the racilitr.iive-:. inus-I)ebi litativc 
scale to be higher than the reliability vi tr.e T.-'c i 1 1 l i : : vc or {ho DebilitnLive 
scale. Tliis finding su^;i;ests that the Facilitative-r.inus-Debilitative scale is 
the best over-all neasure of test anxiety instead of the Facilitativc cr Debilita- 
tive scale as sonc authors have ir.plied. 

Two comments concerning the research design used in this study are \:orth 
noting. First, the three forns of the Achiever;.ent .-\nxjety Test varied along two 
dimensions, response forniat and inclusion vs. oziission of tho buffer iteir.s. The 
potential confounding between these two variables could have been avoided through 
the use of a 2x3 factorial design. However, such a design would have necessitated 
the const^'uc tion of three new forms of the .Achievement .•\nxiety Test. Since the 
only purpose of this study was to cor:pare the three forr.s of the Achievement Anxiety 
Test that have been used in published research investigations, a one-factor design 
was employed, .\lthough the authors were willing to find differences without know- 
ing the specific cause, the observed differences between the two revised versions of 
the Achievement Anxier./ Test, neither of which had buffer items, clearly indicate 
that the factor of response format, by itself, has an effect on the instrument. 

The second comment related to the design concerns the use of the self rating 
as the covariate. The present authors recognize that there might be differences 
between the v:ay individuals respond to the Achievem.ent Anxiety Test depending upon 
whether or not they are first asV:ed to rate themselves cn test anxiety. Thus, the 
results obtained in this ';tudy with respect to any particular form, of the Achieve- 
ment Anxiety Test m\^,ht not be replicable with a simila: group of S^s who would be 
asked to complete just the Achievement Anxiety Test and not the self rating. In 
spite of this possibility, the administration of the self taring prior to the Achieve- 
ment Anxiety Test in the present study cannot be considered as a plausible rival 

hypothesis for the observed differt-nces between the three forms of the Achievement 

«> 
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Anxicity Test. Since all Ss were expor.cd to the soif rating first, a sensitizing 
effect — if there was one — was controlled. 

In conclusion, the differential results obtained with three variations of 
the same instrument in this investigaLion do not indicate that slight alterations 
in an instrument will always create a different score distribution. The results 
do, however, argue for comprehensive description of variables in a study to eliminate 
error in replication, and also for an indication of any changes that have been made 
to the measuring instruments used within the study. 
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